Multidimensional content eXploration
نویسندگان
چکیده
Content Management Systems (CMS) store enterprise data such as insurance claims, insurance policies, legal documents, patent applications, or archival data like in the case of digital libraries. Search over content allows for information retrieval, but does not provide users with great insight into the data. A more analytical view is needed through analysis, aggregations, groupings, trends, pivot tables or charts, and so on. Multidimensional Content eXploration (MCX) is about effectively analyzing and exploring large amounts of content by combining keyword search with OLAP-style aggregation, navigation, and reporting. We focus on unstructured data or generally speaking documents or content with limited metadata, as it is typically encountered in CMS. We formally present how CMS content and metadata should be organized in a well-defined multidimensional structure, so that sophisticated queries can be expressed and evaluated. The CMS metadata provide traditional OLAP static dimensions that are combined with dynamic dimensions discovered from the analyzed keyword search result, as well as measures for document scores based on the link structure between the documents. In addition, we provide means for multidimensional content exploration through traditional OLAP rollupdrilldown operations on the static and dynamic dimensions, solutions for multi-cube analysis and dynamic navigation of the content. We present our prototype, called DBPubs, which stores research publications as documents that can be searched and –most importantly– analyzed, and explored. Finally, we present experimental results of the efficiency and effectiveness of our approach.
منابع مشابه
DBPubs: multidimensional exploration of database publications
DBPubs is a system for effectively analyzing and exploring the content of database publications by combining keyword search with OLAP-style aggregations, navigation, and reporting. DBPubs starts with keyword search over the content of publications. The publications’ metadata such as title, authors, venues, year, and so on, provide traditional OLAP static dimensions, which are combined with dyna...
متن کاملScale Space Exploration For Mining Image Information Content
Images are highly complex multidimensional signals, with rich and complicated information content. For this reason they are difficult to analyze with a specific automated approach. However, a hierarchical representation is helpful for understanding image content. In this paper, we describe an application of a scale-space clustering algorithm (melting) for exploration of image information conten...
متن کاملUsing OLAP and Data Mining for Content Planning in Natural Language Generation
We present a new approach to content determination and content organization in the context of natural language generation for quantitative database summaries. Three key properties make our work innovative and interesting: (1) we developed a new text planning approach to deals with the content organization of a data set into a summary report, for example a Data Mining discovery; (2) the approach...
متن کاملThe exploration of the competencies of faculty members in quality teaching
Background & Objective: The most technical and basic function of higher education is teaching and education, and providing high-quality teaching requires a certain level of competencies observed in the main custodian of teaching in universities- i.e. the faculty members. This study aimed to evaluate the faculty members’ competencies in high-quality teaching. Materials and Methods: This was a q...
متن کاملMetadata for Multidimensional Categorization and Navigation Support on Multimedia Documents
An increasing technological effort is spent on integrated representations of document collections and metadata. For instance the emerging XML standard offers opportunities to represent metadata in for, e.g., improving query and navigation support within web-based document collections. Despite this development, most applications of catalogue metaphors on the web ranging from small web site catal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 1 شماره
صفحات -
تاریخ انتشار 2008